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ROBUST MODE STAGGERCASTING STORING CONTENT 

The present patent application claims priority from provisional patent 
application no. 60/443,672 filed on January 28, 2003. 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The present invention relates to staggercasting methods and apparatus. 

2. Background of the invention 

Current digital television transmission standards in the United States, as 
proposed by the Advanced Television Systems Committee (ATSC) dated September 
16, 1995, incorporated by reference herein, use a single carrier modulation 
technique: eight level vestigial sideband modulation (8-VSB). Because it is a single 
carrier modulation technique, it is susceptible to signal degradation in the 
communications channel, such as fading caused by multipath and other signal 
attenuation. While some such fading may be compensated by channel equalization 
techniques, if the fade is long enough and severe enough, then the receiver will lose 
the signal and the demodulator system will lose synchronization. Reacquiring the 
signal, and resynchronizing the demodulator can take several seconds and is quite 
objectionable to a viewer. 

To overcome this problem, a first ATSC proposal permits creation of a second 
communications channel by permitting use of a more robust modulation technique for 
a limited period of time, e.g. less than 10%. For example, a 2 or 4-VSB modulation 
technique may be used for selected frames. A second ATSC proposal permits a 
more robust encoding technique, e.g. trellis encoding, while maintaining an 8-VSB 
modulation technique. Such a system permits improved performance with 
compatible receivers while maintaining backwards compatibility with existing 
receivers. 
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Another known technique for overcoming fading is staggercasting. PCT 
Application No. US02/22723 filed July 17, 2002, by K. Ramaswamy, et al., and PCT 
Application No. US02/23032 filed July 19, 2002 by J. A. Cooper, et al., incorporated 
by reference herein, disclose staggercasting communications systems. 
5 Staggercasting communications systems transmit a composite signal including two 
component content representative signals: one of which is delayed with respect to the 
other. Put another way, one of the component content representative signals is 
advanced with respect to the other. The composite signal is broadcast to one or 
more receivers through a communications channel. At a receiver, the advanced-in- 

1 0 time component content representative signal is delayed through a delay buffer so 
that it becomes resynchronized in time with the other component content 
representative signal. Under normal conditions, the undelayed received component 
content representative signal is used to reproduce the content. If, however, a signal 
fade occurs, then the previously received and advanced-in-time content 

1 5 representative signal in the delay buffer is used to reproduce the content until either 
the fade ends and the composite signal is available again, or the delay buffer 
empties. If the delay period, and the associated delay buffer, is large enough then 
most probable fades may be compensated for. 

PCT Application No. US02/22723 filed July 17, 2002, by K. Ramaswamy, et 
20 al., and PCT Application No. US02/23032 filed July 1 9, 2002 by J. A. Cooper, et 

al.also disclose a staggercasting system in which one of the component signals in the 
composite signal represents the content at a higher quality than the other component 
signal. In this arrangement, the lower quality component signal is advanced in time 
relative to the higher quality component signal. As described above, at the receiver 
25 under normal conditions, the undelayed received component, which in this case is the 
higher quality component signal, is used to reproduce the content. If, however, a 
signal fade occurs, then the previously received and advanced-in-time content 
representative signal, which in this case is the lower quality component signal, in the 
delay buffer is used to reproduce the content until either the fade ends and the 
30 composite signal is available again, or the delay buffer empties. This permits 

reproduction of a higher quality signal under normal conditions, and reproduction of a 
lower quality signal in the presence of a fade event. Because the lower quality signal 
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requires fewer bits to transmit, the overhead required to provide fade resistance is 
decreased. 

Reductions in size of storage devices have made it possible to incorporate 
such storage devices in many electronic systems. For example, televisions receiver 
set-top-boxes, for both cable and satellite reception, incorporate such storage 
devices. 

BRIEF SUMMARY OF THE INVENTION 

The inventors have realized that this concept may be expanded to a system in 
which multiple component signals, all representing the content but at differing 
qualities, are included in the composite signal. The component signal representing 
the content at the lowest quality is undelayed in the composite signal. The higher 
quality component signals are delayed with respect to the lowest quality encoded 
signal: the higher the quality of the component signal, the longer the delay. In such a 
system, when all component signals are available, then content representative 
signals of all the qualities may be reproduced. Further, some or all of the component 
signals may be encoded using a relatively robust encoding. 

The inventors have also realized that such staggercasting communications 
systems may also be adapted to operate with electronic storage devices in an 
enhanced manner. In such a system, one of the received content representative 
signals will be stored in the storage device. There are conditions where a user will 
want to specify storage of a content representative signal at a desired quality, from 
among the available content representative signals. 

In accordance with principles of the present invention, a method and 
apparatus for storing staggercasted content includes encoding a set of signals 
representing content. The set of signals is capable of being decoded to produce a 
corresponding set of decoded signals, each decoded signal having a quality different 
from the qualities of the decoded signals corresponding to the other encoded signals. 
A composite signal comprising the set of encoded signals, staggered in time, is 
generated. The set of encoded signals is extracted from the composite signal. 
Errors in the set of extracted encoded signals are detected to produce a subset of 
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available extracted encoded signals which are not erroneous. A content 
representative signal at a selectable desired quality is decoded. The decoded 
content representative signal is then stored in a storage device. 

BRIEF DES CRIPTION OF THE DRAWING 

Fig. 1 is a block diagram of a portion of a staggercasting transmitter; 

Fig. 2 is a block diagram of a portion of a staggercasting receiver; 

Fig. 3 is a packet timing diagram useful in understanding the operation of the 
staggercasting communications system illustrated in Fig. 1 and Fig. 2; 

Fig. 4 is a GOP timing diagram useful in understanding the operation of an 
enhanced staggercasting communications system; 

Fig. 5 is a block diagram of a selector which may be used in the receiver 
illustrated in Fig. 2; 

Fig. 6 is a block diagram of a portion of another embodiment of a 
staggercasting receiver; 

Fig. 7 is a video frame timing diagram useful in understanding the operation of 
the staggercasting receiver illustrated in Fig. 6; 

Fig. 8 illustrates an extended syntax and semantics for the program map table 
(PMT) and/or program and information systems protocol — virtual channel table 
(PSIP-VCT); 

Fig. 9 is a block diagram of a portion of another embodiment of a 
staggercasting transmitter for transmitting multiple resolution version of a content 
representative signal; 

Fig. 1 0 is a block diagram of a portion of another embodiment of a 
staggercasting receiver for receiving a transmitted multiple resolution version of a 
content representative signal; 
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Fig. 11 is a block diagram of a portion of a transmitter for transmitting a dual 

interlaced content representative signal- 
Fig. 12 is a block diagram of a portion of a receiver for receiving a dual 

interlaced content representative signal; and 

5 Fig. 13 is a display diagram useful in understanding the operation of the dual 

interlace transmitter illustrated in Fig. 1 1 and dual interlace receiver illustrated in Fig. 
12. 

DETAILED DESCRIPTION OF THE INVENTION 

Fig. 1 is a block diagram of a portion of a staggercasting transmitter 1 00 
1 0 according to principles of the present invention. One skilled in the art will understand 
that other elements, not shown to simplify the figure, are needed for a complete 
transmitter. One skilled in the art will further understand what those elements are 
and how to select, design, implement and interconnect those other elements with the 
illustrated elements. 

1 5 In Fig. 1 , a source (not shown) of content, which in the illustrated embodiment 

may be a video image signal, audio sound image, program data, or any combination 
of these, provides a content representative signal to an input terminal 105 of the 
transmitter 100. The input terminal 105 is coupled to respective input terminals of a 
robust mode encoder 110 and a normal mode encoder 120. An output terminal of the 

20 robust mode encoder 1 1 0 is coupled to a first input terminal of a multiplexer 1 40. An 
output terminal of the normal mode encoder 120 is coupled to an input terminal of a 
delay device 130. An output terminal of the delay device 130 is coupled to a second 
input terminal of the multiplexer 140. An output terminal of the multiplexer 140 is 
coupled to an input terminal of a modulator 150. An output terminal of the modulator 

25 150 is coupled to an output terminal 115. The output terminal 115 is coupled to a 
communications channel (not shown). 

In operation, the normal mode encoder 120 encodes the content video, audio 
and/or data using a source encoding technique. In the illustrated embodiment, the 
source encoding technique is the MPEG 2 encoding technique, although any such 
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source encoding technique may be used; The source encoding process is performed 
using predetermined parameters including resolution, frame rate, quantization level, 
etc. Further processing is performed in the normal mode encoder 1 20 to system 
encode the source encoded content representative signal. In the illustrated 
5 embodiment, the source coded content representative signal is formed into a series 
of transport packets containing the encoded video, audio and/or data. These 
transport packets are formatted according to the MPEG 2 standard, although any 
such system encoding may be used. 

The robust mode encoder 110 also encodes the content video, audio and/or 

10 data, using a source encoding technique. The source encoding technique used by 
the robust mode encoded 1 1 0 is more robust compared with the source encoding 
technique of the normal mode encoder 120. In the illustrated embodiment, the robust 
mode encoding used is a video coding technique designated MPEG AVC/H.264 
currently being developed by the Joint Video Team (JVT) of the ISO/lEC MPEG and 

15 ITU-T VCEG committees, and termed JVT coding below. However, any such source 
encoding technique may be used. For example, other source coding techniques, 
such as enhanced trellis coding, which provide robust encoding relative to the MPEG 
normal mode encoder 120, may also be used. The robust encoding process is also 
performed using predetermined parameters including resolution, frame rate, 

20 quantization level, etc., but the values of these parameters may be different for the 
robust encoding process than those for the normal encoding process. Processing is 
also performed in the robust mode encoder 1 1 0 to system encode the source 
encoded content representative signal. In the illustrated embodiment, the source 
coded content representative signal is formed into a series of transport packets, also 

25 according to the MPEG 2 standard, although, again, any such system encoding may 
be used. 

The normal mode encoded signal is delayed by the delay device 1 30 by an 
amount intended to allow the system to operate through a range of expected fade 
periods. The value of this parameter depends on the characteristics of the 
30 communications channel. For example, in an urban setting, with many buildings and 
moving objects, such a airplanes, fading is much more common and can last longer 



WO 2004/066706 



PCT/US2004/002062 



than in rural flat settings. In the illustrated embodiment, the delay may be varied from 
around 0.5 seconds to several seconds. 

Fig. 3 is a packet timing diagram useful in understanding the operation of the 
staggercasting communications system illustrated in Fig. 1 and Fig. 2. Fig. 3 
5 illustrates the system coded transport packet streams at the input terminal of the 
multiplexer 140. In Fig. 3, packets from the robust mode encoder 1 10 are 
represented by a horizontal row of squares 300, labeled using lower case letters: "a", 
"b", "c", and so forth. Packets from the normal mode encoder 120 are represented by 
a horizontal row of squares 310, labeled using numbers: "0", "1", and upper case 

1 0 letters: "A", "B", "C", and so forth. Packets labeled by the same letter contain data 
representing content from the same time. That is, packet "a" from the robust mode 
encoder 110 contains data representing content which corresponds in time to the 
content represented by the data in packet "A" from the normal mode encoder 120. 
Each packet in the normal mode and robust mode packet streams contains data in 

15 the header identifying them as belong to that packet stream. The delay device 1 30 
delays the normal mode encoder 120 packets by a time delay T a d v . That is, robust 
mode packets are advanced in time by T ac jv with respect to corresponding normal 
mode packets. In the embodiment illustrated in Fig. 3, T adv is ten packet time 
periods. This time period may vary from around 0.5 seconds to several seconds, as 

20 described above. 

The robust mode and delayed normal mode packet streams are multiplexed 
together into a composite packet stream in the multiplexer 140. The composite 
packet stream is time domain multiplexed, meaning that a single data stream carrying 
successive packets, one at a time, is produced. Additional packets containing other 

25 data, such as identification and control data (not shown), may also be multiplexed 

into the composite packet stream produced by the multiplexer 140. In addition, other 
packet streams representing other content sources (also not shown), possibly 
including both normal mode and robust mode packet streams representing one or 
more of the other content representative signals, may also be multiplexed into the 

30 composite packet stream produced by the multiplexer 1 40, all in a known manner. 
The packet streams 300 and 310 in Fig. 3 represent the component content 
representative signals in the composite packet stream. As may be seen, packet "A" 
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from the normal mode encoder, 120 is transmitted at the same time as packet "k" from 
the robust mode encoder 110. 

The composite packet stream from the multiplexer 140 is then channel coded 
for transmission over the communications channel. In the illustrated embodiment, the 
channel coding is done by modulating the composite packet stream in the modulator 
1 50. The channel coding for the normal mode packet stream is different from the 
channel coding for the robust mode packet stream. More specifically, the modulation 
applied to the robust mode packet stream is more robust than that applied to the 
normal mode packet stream. In the illustrated embodiment, when packets in the 
normal mode packet stream are modulated, the modulation is 8-VSB modulation 
according to the ATSC standard. When packets in the robust mode packet stream 
are modulated, the modulation is more robust modulation, for example 4-VSB or 2- 
VSB, as described above. 

In short, in the illustrated embodiment, the normal mode packet stream is 
source encoded using the MPEG 2 encoding technique and is channel encoded 
using 8-VSB modulation. This is fully backward compatible with the prior ATSC 
standard. Also in the illustrated embodiment, the robust mode packet stream is 
source encoded using the JVT encoding technique and is channel encoded using 4- 
VSB and/or 2-VSB modulation. One skilled in the art will understand that the new 
ATSC standard, referred to above, refers only to the channel encoding of the robust 
mode packet stream, i.e. 4-VSB and/or 2-VSB, and does not specify a source 
encoding technique. Consequently, any such source encoding technique may be 
used according to the standard, and the JVT encoding technique in the illustrated 
embodiment is one example of such source encoding for the robust mode packet 
stream. In the remainder of this application, 'normal mode packet stream' will refer to 
the packet stream which is source encoded using the MPEG 2 source encoding 
technique, system encoded into packets according to the MPEG 2 standard, and 
channel encoded using 8-VSB modulation; and 'robust mode packet stream' will refer 
to packets which are source encoded using the JVT source encoding technique, 
system encoded into packets according to the MPEG 2 standard, and channel 
encoded using 4-VSB and/or 2-VSB modulation. 



WO 2004/066706 



PCT/US2004/002062 



9 

The modulated composite signal is then supplied to the communications 
channel (not shown), which may be a wireless RF channel, or a wired channel, such 
as a cable television system. The composite signal may be degraded by the 
communications channel. For example, the signal strength of the composite signal 
may vary. In particular, the composite may fade due to multipath or other signal 
attenuation mechanisms. One or more receivers receive the possibly degraded 
composite signal from the communications channel. 

Fig. 2 is a block diagram of a portion of a staggercasting receiver 200 
according to principles of the present invention. In Fig. 2, an input terminal 205 is 
connectable to the communications channel (not shown) so that it is capable of 
receiving the modulated composite signal produced by the transmitter 100 (of Fig. 1). 
The input terminal 205 is coupled to an input terminal of a demodulator 207. An 
output terminal of the demodulator 207 is coupled to an input terminal of a 
demultiplexer 210. A first output terminal of the demultiplexer 210 is coupled to a 
selector 230. A second output terminal of the demultiplexer 210 is coupled to a delay 
device 220. An output terminal of the delay device 220 is coupled to a second Input 
terminal of the selector 230. An output terminal of the selector 230 is coupled to a 
signal input terminal of a multi-standard decoder 240. A control signal output terminal 
of the demultiplexer 210 is coupled to respective corresponding input terminals of the 
selector 230 and the multi-standard decoder 240. An output terminal of the multi- 
standard decoder 240 is coupled to an output terminal 215 The output terminal 215 
produces a content representative signal which is supplied to utilization circuitry (not 
shown) such as a television receiver with an image reproduction device to reproduce 
the image represented by the video content, a sound reproduction device to 
reproduce the sound represented by the audio content, and possibly including user 
input devices to allow a viewer to interact with the received data content. 

In operation, the demodulator 207 demodulates the received modulated signal 
using the appropriate demodulation techniques required to receive packets from 
either the normal mode packet stream (8-VSB) or robust mode packet stream (4-VSB 
and/or 2-VSB). The resulting signal is a received composite packet stream signal. 
The received composite packet stream signal is demultiplexed by the demultiplexer 
210 into respective normal mode source encoded and robust mode source encoded 
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component packet streams according to the identification data in the header of each 
received packet. The received normal mode packet stream is supplied directly to the 
selector 230. The received robust mode packet stream is passed through the delay 
device 220, which delays the received robust mode packet stream by the same time 
5 duration that, in the transmitter 100 of Fig. 1, the normal packet stream is delayed. 
Consequently, the content represented by the two packet stream signals at the input 
terminals of the selector 230 is time aligned. 

The demultiplexer 210 also produces an error signal at the control signal 
output terminal should a portion of the received composite signal be unusable. Any 

10 of several techniques may be used, for example, a signal-to-noise ratio detector or a 
bit-error rate detector. In addition, an error in the received composite signal may be 
detected by detecting missing packets. Each packet includes in its header both data 
identifying which packet stream the packet belongs to and a packet sequence 
number. If a sequence number for a packet stream is missed, then a packet is 

15 missing, and an error is detected. In this case, the packet stream from which the 
packet is missing may be noted, and only that packet stream detected as having an 
error. These or any other such detector may be used, alone or in combination. 

Although the control signal is illustrated as emanating from the demultiplexer 
210, one skilled in that art will understand that different error detectors may be 

20 require signals from different places in the receiver. Whatever arrangement is used, 
an error signal E is generated which is active when a portion of the composite signal 
is unusable. The selector 230 is conditioned to pass one of the two packet streams 
signals to the multi-standard decoder 240 in response to this error signal E. The 
multi-standard decoder 240 is conditioned to decode that packet stream signal, in a 

25 manner to be described in more detail below. 

The multi-standard decoder 240 performs both system decoding 
(depacketizing) and source decoding of whichever packet stream is supplied to it by 
the selector 230. The multi-standard decoder 240 can be configured to perform 
source decoding of the packet stream signals according to different coding standards. 
30 For example, when a normal mode encoded packet stream is received from the 

selector 230, the multi-standard decoder 240 is configured to depacketize and source 
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decode these packets according to the MPEG 2 standard and regenerate the content 
representative signal. Similarly, when a robust mode encoded packet stream is 
received from the selector 230, the multi-standard decoder 240 is configured to 
depacketize the packets according to the MPEG 2 standard and to source decode 
5 these packets according to the JVT standard, and regenerate the content 
representative signal. 

The operation of the receiver 200 of Fig. 2 may be understood by referring 
again to Fig. 3. Time to may represent the time when the receiver is turned on, or 
when a user specifies a new content source to receive. During the time, T adVf 

10 between tO and t4, robust mode packets "a" to "j" are loaded into the delay device 
220, and normal mode packets, designated "0" though "9" are received. At time t4, 
the normal mode packet "A" becomes available from the demultiplexer 210 and 
delayed robust mode packet "a" becomes available from the delay device 220. 
Under normal conditions, the error signal is not active on the error signal line E. In 

1 5 response, the selector 230 couples the normal mode packet stream to the multi- 
standard decoder 240, and the multi-standard decoder 240 begins to generate the 
content representative signal from the normal mode packets, as described above. 
This is illustrated by the cross hatching 301 in the normal mode packets "A" through 
■GV 

20 From time t1 to t2 a severe fade occurs in the communications channel and 

from time t2 to t3 the receiver recovers the modulated signal and resynchronizes to 
that signal. During this time, from t1 to t3, normal mode packets "H" to "M" and 
robust mode packets V to "w" are lost. This is indicated by the cross hatching 302 
and 303 in those packets. However, robust mode packets "h" to "m" have been 

25 previously successfully received. Because of the delay device 220, these robust 
mode packets are available at the other input to the selector 230 from time t1 to t3. 

The occurrence of the fade is detected and indicated by an active error signal 
on the error signal line E. In response to the active error signal on the error signal 
line E, the selector 230 couples the previously received robust mode packets "h" to 
30 "m" to the multi-standard decoder 240. Concurrently, the multi-standard decoder 240 
is configured to depacketize and decode robust mode packets. Consequently, from 
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time t1 to t3, packets "h" to "m" from the robust mode packet stream are decoded and 
the content representative signal remains available to the utilization circuitry (not 
shown). This is illustrated by the cross hatching 301 in the robust mode packets "h" 
through "m". 

At time t3, the fade ends and the composite signal becomes available again. 
Consequently the normal mode packets "N", "O", "P", become available. The 
disappearance of the fade is detected and indicated by an inactive error signal on the 
error signal line E. In response, the selector 230 couples the normal mode packet 
stream to the multi-standard decoder 240. Concurrently, the multi-standard decoder 
240 is configured to depacketize and decode normal mode packets and continues to 
generate the content representative signal. 

During the fade and recovery, from time t1 to t3, robust packets "r" through "w" 
were lost. Consequently, from time t6 through t7, when normal mode packets "FT 
through "W" are received, there are no corresponding robust mode packets in the 
delay device 220. During this time, there is no protection against a fade. However, 
once the delay device is refilled, fade protection becomes available again. 

As described above, the content representative signal remains available to the 
utilization circuitry (not shown) despite the occurrence of a fade from time t1 to t3. In 
addition, because of the robust source coding and channel coding (modulation) 
techniques, the robust mode packets are likely to survive more severe channel 
degradation, and thus be available when normal mode packets may not be. The 
quality of the content signal carried by the robust mode packet stream may be 
different from that in the normal mode packet stream. In particular, the quality of the 
content signal in the robust mode packet stream may be lower than that in the normal 
mode packet stream. A lower quality content signal requires fewer bits to transmit 
than a higher quality content signal, and such a robust mode packet stream will 
require a lower throughput than the normal mode packet stream. Thus, at the 
expense of a second, lower throughput packet stream, a system which will permit a 
graceful degradation in the event of a fading event is possible. 
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Afso as described above, the content signal may include video, audio and/or 
data, in particular, audio data may be carried in both the normal mode packet stream 
and the robust mode packet stream so that audio data also remains available despite 
the occurrence of a fade. The audio content signal carried by the robust mode 
packet stream may have a different quality, specifically a lower quality, than that in 
the normal mode packet stream. An audio signal at a lower quality may be carried 
by fewer bits and fewer packets, and, thus, would make relatively low requirements 
on the robust mode packet stream. This also would permit a graceful degradation in 
the event of a fade event. 

With a system described above, switching from the normal mode packet 
stream to the robust mode packet stream may occur at any time, if the robust packet 
stream carries content representative data which is identical to that in the normal 
packet stream down to the packet level, this may not present a problem. However, if 
the robust packet stream carries content representative data which is different from 
that in the normal packet stream, for example, if the content is represented at a 
different resolution, quantization level, frame rate, etc., then the viewer may notice a 
change in the reproduced image which may be objectionable. In a worse case, if a 
packet stream switch occurs in the middle of decoding a picture, then the decoding of 
that picture and other surrounding pictures may fail altogether, and the video image 
may be disrupted for a much longer period of time, until the decoder resynchronizes 
to an independently decodable picture. 

As described above, the normal mode packet stream is carried by a 
combination of source, system and channel encoding. In the illustrated embodiment, 
the source and system coding is according to the known MPEG 2 coding scheme and 
the channel encoding uses the 8-VSB modulation technique. The MPEG source 
coding scheme encodes a video image signal as a sequence of independent 
decoding segments. An independent decoding segment (IDS), also termed an 
elementary stream segment, is a segment which may be decoded accurately 
independent of any other independent decoding segment. In the MPEG standard, 
independent decoding segments include a sequence, group of pictures (GOP) and/or 
picture. These independent decoding segments are delimited in the compressed 
bitstream by unique start codes. That is, an independent decoding segment is 
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considered to be all the data beginning with a segment start code, up to but not 
including the next segment start code. Pictures in the MPEG 2 standard are either 
intra-coded (I pictures), inter-prediction (P pictures) or bi-directional prediction (B) 
pictures. I pictures are encoded without reference to any other pictures. A GOP 
includes a group of pictures encoded as a combination of I, P, and/or B pictures. In a 
closed GOP, all pictures in the GOP may be decoded without reference to pictures in 
any other GOP. The start of each GOP is clearly identified in the MPEG 2 packet 
stream. 

Also as described above, the robust mode packet stream is carried by a 
combination of source, system and channel encoding. In the illustrated embodiment, 
the source encoding is according to the JVT encoding scheme, the system encoding 
is according to the MPEG 2 standard and the channel encoding uses the 2-VSB 
and/or 4-VSB modulation techniques. Pictures coded using the JVT source coding 
standard are made up of coded slices, and a given picture may contain slices of 
different coding types. Each slice may be an intra-coded (I) slice, an inter-predictive 
(P) slice, a bi-predictive (B) slice, an SI slice in which only spatial prediction is used, 
or an SP slice which may be accurately reproduced even when different reference 
pictures are used. The JVT source coding standard also includes an instantaneous 
decoding refresh (IDR) picture. An IDR is a particular type of JVT encoded picture, 
which contains only I slices and marks the beginning of an IDS. An IDR indicates 
that the current picture, and all later encoded pictures may be decoded without 
requiring reference to previous pictures. An IDR may be encoded once for every 
predetermined number of pictures, emulating a GOP in the MPEG 2 standard. In the 
JVT source encoding scheme, independent decoding segments may be delimited by 
IDRs, which are clearly identified in the JVT packet stream. 

By imposing some constraints on the normal and robust source encoding 
schemes, a system may be developed which can switch from the normal mode 
packet stream to the robust mode packet stream while minimizing objectionable 
artifacts. If independent decoding segments are encoded to begin at identical 
content locations in both the normal (MPEG 2) and robust (JVT) packet streams, 
switches may be made between the normal and robust packet streams at 
independent decoding segment locations with minimal objectionable artifacts. In the 
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illustrated embodiment, the independent decoding segment used in the normal 
(MPEG 2) packet stream is a closed GOP and begins with an I picture. In the 
corresponding robust (JVT) packet stream, each independent decoding segment 
begins with an IDR picture. The I picture in the normal (MPEG) mode packet stream 
5 and the IDR picture in the robust (JVT) mode packet stream both encode the same 
video picture from the content source (not shown). Both source encoding schemes 
permit I DSs to be formed and delimited in other manners. For example, the MPEG 2 
source encoding scheme also permits slices to be formed to represent a picture. Any 
such manner may be used provided that I DSs are inserted into both packet streams 
1 0 at identical content locations. 

Referring again to Fig. 1 , the input terminal 105 is further coupled to an input 
terminal of a scene cut detector 1 60, illustrated in phantom. An output terminal of the 
scene cut detector 160 is coupled to respective control input terminals of the normal 
mode encoder 1 20 and the robust mode encoder 110. 

15 In operation, the scene cut detector 160 detects the occurrence of a new 

scene in the video content. In response to detection of a new scene, a control signal 
is sent to the normal mode encoder 120 and the robust mode encoder 110. Both the 
normal mode encoder 120 and the robust mode encoder 110 begin encoding a new 
independent decoding segment in response to the control signal. The normal mode 

20 encoder 120 inserts a new I picture and the robust mode encoder 110 inserts an IDR 
> picture into their respective encoded packet streams. The normal mode encoder 120 
and the robust mode encoder 110 operate to generate corresponding independent 
decoding segments having the same time durations. As described above, the 
encoded content representative signals are system coded into respective packet 

25 streams. 

The delay device 130 is set to introduce a delay equal to the independent 
decoding segment time duration. The multiplexer 140 combines the robust mode 
encoded packet stream and the delayed normal mode encoded packet stream into a 
composite packet stream. The composite packet stream is channel encoded 
30 (modulated) in an appropriate manner by the modulator 150 and supplied to the 
communications channel via the output terminal 115. 
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The operation of the transmitter in this mode of operation may be better 
understood by reference to Fig. 4. Fig. 4 illustrates the packet streams at the input 
to the multiplexer 140. In Fig. 4, a sequence of independent decoding segments 
(IDS) from the robust mode encoder 110 is illustrated as a series of rectangles 400, 
5 and a sequence of independent decoding segments from the normal mode encoder 
120 is illustrated as a series of rectangles 410. As described above, the time 
locations within the content, and the durations of the independent decoding segments 
from the robust mode encoder 1 1 0 and the normal mode encoder 1 20 are the same. 
Because the delay introduced by the delay device 130 is the same as the time 
10 duration of an IDS, IDSs from the robust mode encoder 110 align with the 
immediately preceding IDS from the normal mode encoder 120. 

At time to, which may represent a change in scene, as detected by the scene 
cut detector 1 60, the undelayed robust mode encoded IDS N begins and the 
previously delayed normal mode encoded IDS N-1 begins. Each robust mode (JVT 

15 source coded) IDS is illustrated as a series of rectangles 440 representing respective 
slices, and begins with an independent decoding refresh (IDR) picture. The IDR 
picture is followed by B, P, SI, and/or SP slices. These slices are, in turn, system 
coded into a sequence 450 of transport packets "a", "b", "c", etc. Similarly, each 
normal mode IDS (MPEG 2 source coded) is illustrated as a series of rectangles 420 

20 representing a GOP which begins with an I picture. The I picture is followed by an 
arrangement of P pictures and B pictures. These I, P and B pictures are, in turn, 
system coded into a sequence 430 of transport packets "A", "B", "C", etc. The 
illustrated arrangements are examples only, and any appropriate arrangement may 
be used. 

25 This composite signal is received by a receiver. Referring again to the 

receiver 200 in Fig. 2, at time to, the received robust mode IDS N is loaded into the 
delay device 220 during time T adv . The delay device 230 introduces the same delay 
(one IDS time period) to the received robust packet stream that in the transmitter the 
delay device 130 introduced into the normal packet stream. Consequently, the 

30 received normal packet stream and delayed robust packet stream at the input 
terminals of the selector 230 are realigned in time with respect to the content 
representative signal. 



cnnrin. 



WO 2004/066706 



PCT/US2004/002062 



17 

Under normal conditions, the selector 230 couples the normal mode packet 
stream to the multi-standard decoder 240, and the multi-standard decoder is 
conditioned to decode normal mode packets, as described in more detail above. If 
an error is detected in the composite signal or a portion of it, as described above, 
5 then switching is performed between the normal mode packet stream and the robust 
mode packet stream. In this embodiment, at the beginning of the IDS, the selector 
230 couples the robust mode packet stream to the multi-standard decoder 240, and 
the multi-standard decoder 240 is conditioned to decode robust mode packets, as 
described in more detail above. If no further errors are detected in the composite 
1 0 signal, then at the beginning of the next IDS, the selector 230 couples the normal 
mode packet stream to the multi-standard decoder 240 and the multi-standard 
decoder 240 is conditioned to decode normal mode packets again. 

In the receiver 200 in Fig. 2 switching from decoding the normal mode packet 
stream to decoding the robust mode packet stream and vice versa occurs at the 

15 beginning of an IDS. Each IDS is an independent decoding segment, beginning with 
either an I picture (normal mode) or an IDR picture (robust mode), which may be 
successfully decoded without reference to any other picture. Further, subsequent 
pictures may be decoded without reference to pictures preceding the IDS. Thus, 
decoding and display of the content representative signal may be immediately 

20 performed without objectionable artifacts caused by the switching. 

To further minimize video artifacts caused by switching from decoding a 
normal mode video packet stream to a robust mode packet stream, and vice versa, 
the image characteristics of the resulting video signal may be gradually changed 
between those of the normal mode video signal and those of the robust mode video 
25 signal when a switch occurs. This is especially desirable when the robust mode 
video stream is lower quality compared to the normal mode video stream, for 
example, if the spatial resolution, frame rate, etc. of the robust mode video stream is 
less than that of the normal mode video stream. 

Fig. 5 is a block diagram of a selector 230" which may be used in the receiver 
30 illustrated in Fig. 3. Such a selector 230" may gradually change the video 

characteristics (e.g. resolution, frame rate, etc.) of the resulting video signal between 
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those of the normal mode video signal and those of the robust mode video signal at 
the time of a switch between them. Fig. 5a is a functional diagram which illustrates 
the operation of selector 230", and Fig. 5b is a structural block diagram illustrating an 
embodiment of such a selector 230" which may be used in the receiver illustrated in 
5 Fig. 2. 

In Fig. 5a, the robust mode video signal is coupled to one end of a track 232 
and the normal mode video signal is coupled to the other end of the track 232. A 
slider 234 slides along the track 232 and generates a resulting video signal which is 
coupled to the output terminal of the selector 230". The resulting video signal is 
10 coupled to the output terminal 215 of the receiver 200 (of Fig. 2). A control input 
terminal is coupled to receive the error signal E from the demultiplexer 210. The 
control input terminal is coupled to an input terminal of a controller circuit 231 . The 
position of the slider 234 along the track 232 is controlled by the controller circuit 231 , 
as indicated in phantom. 

1 5 In operation, when the slider 234 is at the upper end of the track 232, then a 

resulting video signal having the characteristics (e.g. resolution, frame rate, etc.) of 
the robust mode video signal is coupled to the output terminal of the selector 230". 
When the slider 234 is at the lower end of the track 232, then a resulting video signal 
having the characteristics of the normal mode video signal is coupled to the output 

20 terminal of the selector 230". As the slider 234 moves between the upper end and 
the lower end of the track 232, then the characteristics of the resulting video signal at 
the output terminal of the selector 230" is adjusted to be between those of the normal 
mode and robust mode video signals. The closer the slider 234 is to the upper end of 
the track 232, the closer the characteristics of the resulting video signal are those of 

25 the robust mode video signal than to those of the normal mode video signal. The 

closer the slider 234 is to the lower end of the track 232, the closer the characteristics 
of the resulting video signal are those of the normal mode video signal than to those 
of the robust mode video signal. 



30 



The value of the error signal E indicates when a switch is to occur, as 
described above. When a switch occurs from one video signal (e.g. the normal mode 
or robust mode video signal) to the other video signal, for a time interval of one or 
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more video pictures around the time when the switch occurs, the slider 234 is 
gradually moved from one end of the track 232 to the other. For example, during a 
switch from the normal mode video signal to the robust mode video signal, the slider 
234 begins at the bottom of the track. For several video pictures before the switch, 
5 the slider gradually moves from the bottom of the track 232 to the top. At the time of 
the switch from the normal mode packet stream to the robust mode packet stream, 
the slider is at the top of the track 232. Consequently, the characteristics of the 
resulting video signal gradually change from those of the normal video signal to those 
of the robust mode video signal during several video pictures before the switch to the 

10 robust mode packet stream occurs. Similarly, at the time of the switch from the 

robust mode packet stream to the normal mode packet stream, the slider is at the top 
of the track 232. For several video pictures after the switch, the slider gradually 
moves from the top of the track 232 to the bottom. Consequently, the characteristics 
of the resulting video signal gradually change from those of the robust video signal to 

15 those of the normal mode video signal during several video pictures after the switch 
to the normal mode packet stream occurs. 

In Fig. 5b, the video signal from the multi-standard decoder 240 (of Fig. 2) is 
coupled to a first input terminal of a variable video quality filter 236 and a first input 
terminal of a selector 238. An output terminal of the video quality filter 236 is coupled 

20 to a second input terminal of the selector 238. An output terminal of the selector 238 
generates the resulting video signal and is coupled to the output terminal 215 (of Fig. 
2). The error signal E from the demultiplexer 21 0 is coupled to a controller circuit 
231. A first output terminal of the controller circuit 231 is coupled to a control input 
terminal of the video quality filter 236 and a second output terminal of the controller 

25 circuit 231 is coupled to a control input terminal of the selector 238. 

In operation, the video characteristics of the decoded video signal is varied by 
the video quality filter 236 in response to the control signal from the controller circuit 
231 . The control signal from the controller circuit 231 conditions the video quality 
filter 236 to produce a video signal having a range of video characteristics between 
30 those of the normal mode video signal and those of the robust mode video signal. 
Under normal conditions, when no switching occurs, the controller circuit 231 
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conditions the selector 238 to couple the decoder video signal to the output terminal 
as the resulting video signal. 

In response to a change in the value of the error signal E, indicating a switch 
between the normal mode video signal and the robust mode video signal as 
5 described above, for a time interval near the switch time the controller circuit 231 

conditions the selector 238 to couple the video signal from the video quality filter 236 
to the output terminal and conditions the quality filter 236 to gradually change the 
video characteristics of the resulting video signal. More specifically, if a switch from 
the normal mode video signal to the robust mode video signal occurs, for a time 

1 0 interval of several video pictures before the switch occurs the video quality filter 236 
is conditioned to gradually change the video characteristics of the resulting video 
signal from those of the normal video signal to those of the robust video signal. At 
the beginning of that time interval, the selector 238 is conditioned to couple the 
filtered video signal to the output terminal as the resulting video signal. When that 

15 time interval is complete, and the decoded video signal is derived from the robust 
mode packet stream, the selector 238 is conditioned to couple the decoded video 
signal to the output terminal as the resulting video signal. Similarly, if a switch from 
the robust mode video signal to the normal mode video signal occurs, for a time 
interval of several video pictures after the switch occurs the video quality filter 236 is 

20 conditioned to gradually change the video characteristics of the resulting video signal 
from those of the robust video signal to those of the normal video signal. At the 
beginning of that time interval, the selector 238 is conditioned to couple the filtered 
video signal to the output terminal as the resulting video signal. When that time 
interval is complete, and the decoded video signal is derived from the normal mode 

25 packet stream, the selector 238 is conditioned to couple the decoded video signal to 
the output terminal as the resulting video signal. 

Abrupt switching between video signals having different video quality 
(resolution, frame rate, etc.) may cause artifacts which may be objectionable to a 
viewer. Because the video quality of the resulting video signal is gradually reduced 
30 before switching from the normal mode video signal to the robust mode video signal 
and gradually increased after switching from the robust mode video signal to the 
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normal mode video signal, objectionable artifacts resulting from the switch are 
minimized. 

Another embodiment of a staggercasting communications system may also 
provide switching while minimizing objectionable artifacts and does not require any 
5 special placement of IDSs in the normal and robust mode packet streams. A receiver 
200' is illustrated in Fig. 6. In Fig. 6, elements which are similar to those in the 
receiver 200 in Fig. 2 are designated by the same reference number and are not 
described in detail below. In Fig. 6, the first output terminal of the demultiplexer 210 
is coupled to the input terminal of the normal mode decoder 240'. A first output 

10 terminal of the normal mode decoder 240' is coupled to the first input terminal of the 
selector 230' and a second output terminal of the normal mode decoder 240' is 
coupled to a first input terminal of a normal mode frame store 250'. The output 
terminal of the delay device 220 is coupled to the input terminal of the robust mode 
decoder 240". A first output terminal of the robust mode decoder 240" is coupled to 

15 the second input terminal of the selector 230' and a second output terminal of the 

robust mode decoder 240" is coupled to a first input terminal of a robust mode frame 
store 250". The output terminal of the selector 230' is coupled to respective second 
input terminals of the normal mode frame store 250' and the robust mode frame store 
250". An output terminal of the normal mode frame store 250' is coupled to a second 

20 input terminal of the normal mode decoder 240' and an output terminal of the robust 
mode frame store 250" is coupled to a second input terminal of the robust mode 
decoder 240". 

In operation, the delay device 220 introduces the same delay into the robust 
mode packet stream that the delay device 130 in the transmitter 100 (of Fig; 1) 
25 introduces into the normal mode packet stream. Consequently, the packet stream 
signals at the respective input terminals of the normal mode decoder 240' and the 
robust mode decoder 240" are time aligned with respectto the content representative 
signal. 

Both the normal and the delayed robust mode packet streams are system and 
30 source decoded to produce corresponding content representative signal streams, as 
described in detail above. In the illustrated embodiment, these content 
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representative signal streams are respective sequences of video pictures. In both 
normal mode decoding and robust mode decoding, video data representing 
surrounding pictures are required to decode predictive pictures or slices. The normal 
mode frame store 250' holds these surrounding pictures for the normal mode decoder 
240' and the robust mode frame store 250" holds these surrounding pictures for the 
robust mode decoder 250". 

In the receiver illustrated in Fig. 6, switching is performed on a picture-by- 
picture basis rather than on an IDS basis. The normal mode decoder 240' decodes 
normal mode packets into an associated content representative signal containing 
successive video pictures. Concurrently, the robust mode decoder 240" decodes 
robust mode packets into an associated content representative signal containing 
successive video pictures. As described above, the demultiplexer 210 produces an 
error signal on the error signal line E indicating that the composite signal from the 
demodulator 207, or at least some portion of it, is unusable. In the embodiment 
illustrated in Fig. 6, this error signal may be generated by detecting missing packets 
in the demultiplexed packet streams. Thus, the error signal on the error signal line E 
indicates not only that a packet is missing but also which packet stream is missing a 
packet. Because the packets carry in the payload a portion of the data forming a 
video picture carried by the packet stream, and carry data in the header identifying 
the packet stream to which this packet belongs, the packet stream which is missing a 
packet may be marked as erroneous. 

A video picture may be successfully received in both the normal and robust 
mode packet streams; may be successfully received in the normal mode packet 
stream but erroneously received in the robust mode packet stream; may be 
erroneously received in the normal packet stream but successfully received in the 
robust packet stream; or may be erroneously received in both the normal and robust 
mode packet streams. 

Under normal conditions, that is, when no error is detected in either the normal 
mode nor the robust mode packet streams, both the normal mode decoder 240' and 
the robust mode decoder 240" successfully decode the corresponding video picture. 
The selector 230' couples the content representative video picture derived from the 
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normal mode decoder240' to the output terminal 215. Also, under norma! conditions, 
the normal mode decoder 240' supplies video pictures to the normal mode frame 
store 250' and the robust mode encoder 240" supplies video pictures to the robust 
mode frame store 250". 

5 If an error is detected in the robust mode packet stream but no error is 

detected in the normal mode packet stream, then only the normal mode decoder 240' 
successfully decodes the corresponding video picture. The selector 230' couples the 
content representative video picture derived from the norma! mode decoder 240' to 
the output terminal 215. Also, the normal mode decoder 240' supplies the decoded 
10 video picture to the normal mode frame store 250\ However, because the robust 
mode decoder 240" did not successfully decode the corresponding video picture, it 
doesn't supply any video picture to the robust mode frame store 250". Instead, the 
successfully decoded video picture from the normal mode decoder 240' is routed ' 
from the selector 230' to the robust mode frame store 250". 

15 If an error is detected in the normal mode packet stream but no error is 

detected in the robust mode packet stream, then only the robust mode decoder 240" 
successfully decodes the corresponding video picture. The selector 230' couples the 
content representative video picture derived from the robust mode decoder 240" to 
the output terminal 215. Also, the robust mode decoder 240" supplies the decoded 

20 video picture to the robust mode frame store 250". However, because the normal 
mode decoder 240' did not successfully decode the corresponding video picture, it 
doesn't supply any video picture to the normal mode frame store 250'. Instead, the 
successfully decoded video picture from the robust mode decoder 240" is routed from 
the selector 230' to the robust mode frame store 250'. 

25 In the above two cases, the video picture stored in the frame store associated 

with the decoder which did not successfully decode that video picture is the video 
picture from the other decoder. This may degrade subsequent decoding compared to 
what it would be if the correct video picture were stored in the frame store. This is 
especially true should the substituted video picture be of lower quality than the 

30 erroneous video picture. However, the accuracy of subsequent decoding is better 
than if no video picture at all were stored in the frame store. 
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Should an error be detected in a video picture in both the normal mode and 
robust mode packet stream then no accurate video picture is decoded and other 
masking techniques must be performed. 

The operation of the receiver 200' illustrated in Fig. 6 may be better 
understood by reference to Fig. 7. In Fig. 7, a top set of rectangles (MPEG) 
respectively represent the input 420 and output 520 of the normal mode decoder 
240'; a middle set of rectangles (JVT) respectively represent the input 440 and output 
540 of the robust mode decoder 240"; and the bottom set of rectangles (OUTPUT) 
respectively represent the video pictures 460 and their source 560 at the output 
terminal 215. Referring to the MPEG decoding: the upper set of rectangles 420 
represent the source coded video pictures (I, P, and/or B) at the input terminal of the 
normal mode decoder 240'. The lower set of rectangles 520 represent the resulting 
video pictures at the output terminal of the normal mode decoder 240'. Similarly, 
referring to the JVT decoding: the upper set of rectangles 440 represent the source 
coded IDR picture (which may include a plurality of only I slices) and the following 
source coded video slices (I, P, B, SI and/or SP) at the input terminal of the robust 
mode decoder 240". The lower set of rectangles 540 represent the resulting video 
pictures at the output terminal of the robust mode decoder 240". Referring to the 
output terminal 215, the upper set of rectangles 460 represent the output video 
pictures and the lower set of rectangles 560 represent the source of that particular 
video picture. 

More specifically, in the normal mode (MPEG) packet stream, the video 
pictures 6, 10 and 13 are each missing at least one packet, as indicated by 
crosshatching. Similarly, in the robust mode (JVT) packet stream, the video pictures 
7 and 10 are missing at least one packet, as indicated by the crosshatching. All the 
other video pictures for both the normal mode and robust mode packet streams 
include all packets and may be successfully decoded. 

For video pictures 0-5, 8, 9, 1 1, 12 and 14, the selector 230' couples the video 
pictures derived from the normal mode decoder 240' (MPEG) to the output terminal 
215, as indicated by "M" in Fig. 7. In addition, for these video pictures, the video 
pictures from the normal mode decoder 240' are supplied to the normal mode frame 
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store 250' and the video pictures from the robust mode decoder 240" are supplied to 
the robust mode frame store 250". 

For pictures 6 and 13, the video pictures in the normal mode packet stream 
are erroneous but the corresponding video pictures in the robust mode packet stream 
are complete and available. For these pictures, the selector 230' couples the video 
picture from the robust mode decoder 240" (JVT) to the output terminal 215, as 
indicated by "J" in Fig. 7. Because for these pictures there is no normal mode video 
picture, the robust mode video picture from the robust mode decoder 240" is coupled 
to both the robust mode frame store 250" and the normal mode frame store 250'. 

For picture 7, the video picture in the normal mode packet stream is complete 
but the corresponding video picture in the robust mode packet stream is erroneous. 
For this picture, the selector 230' couples the video picture from the normal mode 
decoder 240' to the output terminal 215, as indicated by "M" in Fig. 7. Because for 
this picture there is no robust mode video picture, the normal mode video picture from 
the normal mode decoder 240' is coupled to both the normal mode frame store 250' 
and the robust mode frame store 250". 

For picture 10, the video picture in both the normal mode and robust mode 
packet streams is erroneous. Because there is no valid video picture, some form of 
error masking may be used. This is indicated by an "XX" in Fig. 7. Because there is 
no valid video picture from either the normal mode decoder 240' or the robust mode 
decoder 240", no decoded video picture may be stored in either the normal mode 
frame store 250' or the robust mode frame store 250". The data stored in the frame 
stores 250' and 250" may also be derived from some form of error masking. 

By decoding both packet streams into streams of video pictures, and switching 
from one video stream to the other at the beginning of each video picture, video 
artifacts resulting from failure to property decode a packet stream may be minimized. 
Switching with a gradual change of video quality, as illustrated in Fig. 5 may be used 
in a receiver as illustrated in Fig. 6. However, because in the receiver of Fig. 6 
switching occurs at each picture, artifacts from such switching are not as 
objectionable as when switching occurs at IDS boundaries, as in Fig. 2. 
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Degraded channel conditions may, however, result in frequent switches 
between normal mode and robust mode packet streams. This frequent switching 
may result in artifacts which may be objectionable to a viewer. This is especially true 
if the video quality of the robust mode video signal is substantially different from that 
5 of the normal mode video signal. 

In order to minimize artifacts caused by over-frequent switching between the 
normal mode packet stream and the robust mode packet stream, the selector 230 (of 
Fig. 2) and 230' (of Fig. 6) is configured to restrict switching at more often than a 
predetermined frequency. More specifically, the selector 230 or 230' may monitor the 
10 frequency at which switching is desired, and compare it to a predetermined threshold. 
If the frequency of desired switching is over the threshold, then the frequency at 
which actual switching occurs is restricted to below some maximum frequency. This 
is a form of switching hysteresis. 

For example, assume that the normal mode packet stream carries a video 
15 signal of high quality (e.g. high definition (HD)) and the robust mode packet stream 
carries a video signal of lower quality (e.g. standard definition (SD)). When the 
normal mode HD packet stream is unavailable, then the robust mode SD packet 
stream is processed to generate the image. Upscaling an SD video signal for display 
on an HD display device generates a video image of poor quality. If the normal mode 
20 packet stream is fading in and out frequently, but the robust mode packet stream 

remains available, then frequent switches between the normal mode HD video signal 
and the robust mode SD video signal occur. Frequent switches between HD and SD 
packet streams, with frequent switches between high quality and low quality images, 
produce artifacts which are objectionable to a viewer. 

25 Continuing the example, if the error signal E indicates that switching should 

occur (i.e. normal mode packets are missing) e.g. more than two times per minute, 
then actual switching is restricted to minimize the switching artifacts described above. 
In this example, under these conditions the selector 230 or 230' selects the robust 
mode packet stream for e.g. at least one minute for every switch. This will decrease 

30 the number of switches and, thus, minimize the visible artifacts resulting from those 
switches. One skilled in the art will understand that this is only one embodiment 
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implementing switching hysteresis. The thresholds for the maximum switching 
frequency to invoke hysteresis and for the restricted switching frequency may be 
made different than those of the example. Such thresholds may be determined 
empirically to find those which minimize objectionable visible artifacts. Further, the 
5 thresholds may be dynamically varied during the operation of the receiver. Finally, 
other hysteresis algorithms may be developed to restrict switching in the presence of 
conditions which would normally result in excessive switching. 

Referring again to Fig. 3 and Fig. 4, at the beginning of any broadcast or 
channel change, there is a period designated T adv during which the normal mode 
10 packets (310, 410) are filling the delay device 220 (of Fig. 2 and Fig. 6). In the 

receivers illustrated in Fig. 2 and Fig. 6, only after the delay circuit 220 is full does 
the receiver begin operation. However, this causes undue delay when a receiver is 
switched on or a channel is changed. During the time interval Tadv, however, the 
robust mode packet stream (300, 400) is immediately available. 

15 In Fig. 2, the undelayed robust mode packet stream is coupled directly from 

the demultiplexer 210 to a third input terminal of the selector 230, as illustrated in 
phantom. When the receiver is powered on or a new channel is selected, the 
selector 230 couples the undelayed robust mode packet stream to the multi-standard 
decoder 240. The multi-standard decoder 240 is conditioned to depacketize and 

20 decode the robust mode packets, as described in detail above, and a video signal is 
made immediately available to the utilization circuitry at output terminal 215. When 
the normal mode packet stream becomes available, then the selector 230 will couple 
the normal mode packet stream signal to the multi-standard decoder 240. 

In Fig. 6, the undelayed robust mode packet stream is coupled directly from 
25 the demultiplexer 210 to the robust mode decoder 240". When the receiver is 
powered on or a new channel is selected, the robust mode decoder 240" is 
conditioned to depacketize and decode the robust mode packet stream from the 
demultiplexer 210 and generate a robust mode video signal, as described in more 
detail above. The selector 230' is conditioned to couple the robust mode video signal 
30 from the robust mode decoder 240" to the utilization circuitry via the output terminal 
215. When the normal mode packet stream becomes available, then the normal 
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mode decode 240' depacketizes and decodes it and produces a normal mode video 
signal. The selector 230' is conditioned to couple the normal mode video signal to 
the utilization circuitry via the output terminal 215. 

In either case, data in the normal mode and robust mode packet streams are 
5 analyzed to determine when the normal mode packet stream has become available 
and normal operation of the receiver may be commenced. In accordance with known 
MPEG 2 system (transport packet) encoding, information related to the system time 
clock (STC) in the transmitter is placed in the encoded packet streams via program 
clock reference (PCR) data. Further information, termed a presentation time stamp 

10 (PTS), which indicates when a portion (termed an access unit) of a packet stream 
must be decoded, is included at least at the beginning of each such access unit. 
When the normal mode and robust mode packet streams are depacketized and 
decoded by the multi-standard decoder 240 (Fig. 2) or the normal mode decoder 
240' and the robust mode decoder 240" (Fig. 6), the system time clock (STC) in the 

1 5 receiver is synchronized to that in the transmitter through the PCR data. When the 
value of the PTS in the normal mode packet stream is equal to the value of the 
receiver STC, this indicates that the normal mode packet stream is in synchronism 
with the robust mode packet stream, and the receiver may begin normal operation by 
decoding the normal mode packet stream, as described above. 

20 Because many content representative signals may be transmitted on one 

multiplexed transport packet stream, a known means for supplying information about 
the different packet streams has been developed. Each packet stream is identified 
by a packet identifier (PID), which is included in the header of each packet in that 
packet stream. One packet stream, having a predetermined known PID, contains 

25 one or more data tables containing identification and other information about all the 
other packet streams. This known table structure may be used to carry information 
about robust mode packet streams which are not related to any other normal mode 
packet stream. However, additional information must be sent from the transmitter to 
the receivers about robust packet streams which are related to other normal mode 

30 packet streams. 
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An extended syntax and semantics for these existing tables may carry the 
necessary data. Fig. 8 is a table which illustrates an extended syntax and semantics 
for the program map table (PMT) and/or program and information systems protocol 
— virtual channel table (PSIP-VCT). Each row in Fig. 8 represents either a data item 
in the extended table, or a meta-syntactical description in pseudo-code form. The 
first column is either a name of a data item or a meta-syntactical specification. The 
second column is a description of the data item or syntactical specification. The third 
column is an indication of the size of any data item. 

The first item 802 in the extended syntax is the number of robust packet 
streams used to staggercast other normal mode packet streams. Then information 
for each such staggercast robust mode packet stream is included in the table, as 
indicated by the meta-syntactic specification in the next row and the last row of the 
table. Some such information is required for every robust mode packet stream. 5 For 
example, data 804 represents the program identifier (PID) for the robust mode packet 
stream; data 806 represents the type of data being carried by that packet stream; 
data 808 represents the PID of the normal mode packet stream associated with this 
packet stream; and data 810 represents the delay being introduced into the normal 
mode packet stream by the delay device 130 in the transmitter 100 (of Fig. 1). 

Some such information, however, relates to robust mode packet streams only 
of a particular data type. For example, if the robust mode packet stream carries 
video data, then information 812 related to the compression format, frame rate, 
interlace format, horizontal and vertical resolution, and bit rate is sent from the 
transmitter to the receivers so that the video image represented by the robust mode 
packet stream may be properly decoded and displayed. Similarly, if the robust mode 
packet stream carries audio data, the information 814 related to the compression 
format, bit rate, sample rate; and audio mode (surround, stereo, or mono) is sent from 
the transmitter to the receivers so that the sound represented by the robust mode 
packet stream may be properly decoded and reproduced. 

One other piece of data relates to the relative quality of the content 
representative signal carried by the robust mode packet stream. As described above, 
the quality of the content representative signal carried by the robust mode packet 
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stream may be different from that of the normal mode packet stream with which it is 
associated. In the examples described above, the quality of content representative 
signal carried by the robust mode packet is specified to be lower than that of the 
associated normal mode packet stream. However, under some conditions, the 
5 provider may transmit a higher quality signal on the robust mode packet stream. In 
this condition, it is preferred that receivers use the content representative signal 
carried by the robust mode packet stream instead of the associated normal mode 
packet stream. This is indicated to the receivers by the data 816. 

By providing information associating robust mode packet streams to normal 
1 0 mode packet streams, a receiver 200 (of Fig. 2) or 200' (of Fig. 6) may find both the 
normal mode and robust mode packet streams in the multiplexed packet stream, and 
concurrently process both of them as described above. Prior receivers which do not 
include the capabilities of the receivers of Fig. 2 and Fig. 6 will ignore this 
information and process the normal mode packet stream in the known manner. 

15 As described above, the delay introduced between the robust mode packet 

stream and the associated normal mode packet stream by the delay device 130 in 
the transmitter 100 (of Fig. 1 ) is transmitted as the data 81 0 in the table illustrated in 
Fig. 8. This permits the transmitter to change the delay period and permits the 
receiver to adjust its delay period accordingly. For example, under some channel 

20 conditions fading may be more likely than others, or the characteristics of the fading 
may change (i.e. the fades may be longer). Under such conditions, the delay period 
may be increased. The length of the delay is transmitted to the receivers, which will 
adapt the delay devices 220 (in Fig. 2 and Fig. 6) to the same delay period. Other 
conditions may also require differing delay periods. 

25 The staggercasting concept described above may be expanded. Multiple 

versions of the same content representative signal, encoded into video signals having 
different video quality (e.g. resolution, frame rate, etc.), may be staggercasted. Fig. 
9 is a block diagram of a portion of another embodiment of a staggercasting 
transmitter for transmitting multiple versions of a content representative signal. In 

30 Fig. 9 those elements which are the same as those in the transmitter illustrated in 
Fig. 1 are designated by the same reference number and are not described in detail 
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below. Fig. 10 is a block diagram of a portion of a corresponding embodiment of a 
staggercasting receiver. In Fig. 10, those elements which are the same as those in 
the receiver illustrated in Fig. 2 are designated by the same reference number and 
are not described in detail below. 

5 In Fig. 9a, input terminal 105 is coupled to an input terminal of a hierarchical 

encoder 160. Hierarchical encoder 160 source encodes and packetizes a plurality of 
output packet stream signals. A first one (0) of the plurality of output packet stream 
signals is coupled to a corresponding input terminal of the multiplexer 140'. The 
remainder of the plurality of output packet stream signals, (1) to (n) are coupled to 

10 respective input terminals of a corresponding plurality of delay devices 130(1) to 

130(n). The delay period introduced by the delay device 130(2) is greater than that 
introduced by delay device 130(1); the delay period introduced by the delay device 
130(3) (not shown) is greater than that introduced by delay device 130(2); and so 
forth. The delays may be specified in terms of packets, as illustrated in Fig. 3; 

15 independent decoding segments, as illustrated in Fig. 4; or video picture periods, as 
illustrated in Fig. 7. Respective output terminals of the plurality of delay devices 
130(1) to 130(n) are coupled to corresponding input terminals of the multiplexer 140'. 

In operation, the first packet stream signal (0) carries a base video signal 
source encoded at a lowest video quality. The second packet stream signal (1) 

20 carries extra video information. This extra video information, when combined with the 
base video signal (0) produces a video signal with a higher video quality than that of 
the base video signal (0) alone. The third packet stream signal (2) carries further 
extra video information. The video information in this signal, when combined with the 
base video signal (0) and the video information in the second packet stream signal 

25 (1 ) produces a video signal with a higher video quality than that of the combination of 
the base signal (0) and the second signal (1). The video information in the additional 
packet stream signals, up to packet stream signal (n) from the hierarchical encoder 
1 60, may be combined to produce video signals of higher video quality. The 
multiplexed signal is channel encoded (modulated) and supplied to receivers via 

30 output terminal 115. 
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Fig. 10a is the receiver corresponding to the transmitter illustrated in Fig. 9a. 
The demultiplexer 21 0 extracts a plurality (0) to (n) of packet streams. Packet stream 
(n) is coupled to a corresponding input terminal of a hierarchical decoder 260. The 
remainder (0) to (n-1) (not shown) of the plurality of packet streams are coupled to 
5 respective input terminals of a corresponding plurality 220 of delay devices. The 
plurality 220 of delay devices are conditioned to realign all of the plurality (0) to (n) of 
packet streams in time at the input terminals of the hierarchical decoder 260. The 
error signal on signal line E from the demultiplexer 21 0 is coupled to a control input 
terminal of the hierarchical decoder 260. An output terminal of the hierarchical 
1 0 decoder 260 is coupled to the output terminal 215. 

In operation, the demodulator 207 channel decodes (demodulates) the 
received signal as appropriate, as described in more detail above. The multiplexer 
21 0 extracts the plurality, (0) to (n), of packet streams carrying the hierarchy of video 
information corresponding to the packet streams (0) to (n) illustrated in Fig. 9a. 

1 5 These packet streams are aligned in time by the plurality 220 of delay devices. The 
error signal from the demultiplexer 210 indicates which packet streams are 
unavailable, e.g. missing packets. The plurality of packet streams are depacketized 
and the highest quality video image which may be hierarchically decoded from the 
available packet streams is produced by the hierarchical decoder 260. That is, . if a 

20 fading event has made all but the packet stream (0) carrying the base video signal 
unavailable, then the hierarchical decoder 260 depacketizes and decodes only the 
packet stream (0). If the packet stream (1 ) is also available, then the hierarchical 
decoder 260 depacketizes and decodes both the packet stream (0) and the packet 
stream (1 ) and generates a video signal of higher quality, and so forth. If all packet 

25 streams (0) to (n) are available, then the hierarchical decoder 260 depacketizes and 
decodes them all and generates a video signal of the highest video quality. 

In Fig. 9b, the input terminal 105 is coupled to respective input terminals of a 
plurality 170 of video encoders. The output terminal of a first one 170(0) of the 
plurality 170 of video encoders is coupled to a corresponding input terminal of the 
30 multiplexer 140'. The output terminals of the remainder, 170(1) to 170(n), of the 

plurality 170 of video encoders are coupled to respective input terminals of a plurality 
of delay devices 130(1) to 130(n). The delay period introduced by the delay device 
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130(2) is greater than that introduced by delay device 130(1); the delay period 
introduced by the delay device 130(3) (not shown) is greater than that introduced by 
delay device 130(2); and so forth. The delays may be specified in terms of packets, 
as illustrated in Fig. 3; independent decoder segments, as illustrated in Fig. 4; or 
5 video frame periods, as illustrated in Fig. 7. Respective output terminals of the 
plurality of delay devices are coupled to corresponding input terminals of the 
multiplexer 140'. 

In operation, the first encoder 170(0) source encodes the content 
representative signal and system encodes (packetizes) the resulting source encoded 

1 0 signal to generate a packet stream carrying information representing a video signal at 
lowest quality: in the illustrated embodiment, a quarter-common-interface-format 
(QCIF) video signal. The second encoder 170(1) similarly generates a packet stream 
carrying information representing a video signal at a higher quality than that produced 
by the first encoder 170(0): in the illustrated embodiment, a common-interface-format 

15 (CIF) video signal. Other video encoders, not shown, similarly generate packet 
streams carrying video signals at successively higher video quality. An SD video 
encoder 170(n-1) similarly generates a packet stream carrying an SD quality video 
signal and an HD video encoder 1 70(n) similarly generates a packet stream carrying 
an HD quality video signal. These packet streams are multiplexed by the multiplexer 

20 140' then channel encoded (modulated) and transmitted to the receivers via the 
output terminal 1 1 5. 

Fig. 1 0b is the receiver corresponding to the transmitter illustrated in Fig. 9b. 
In Fig. 10b, the demultiplexer 210 extracts a plurality (0) to (n) of packet streams. 
The packet stream (n) is coupled to an input terminal of a HD decoder 270(n). The 

25 remainder of the packet streams (0) to (n-1 ) are coupled to respective input terminals 
of a plurality 220 of delay devices. Respective output terminals of the plurality 220 of 
delay devices are coupled to corresponding input terminals of a plurality 270 of video 
decoders. Respective output terminals of the plurality 270 of video decoders are 
coupled to corresponding input terminals of a selector. The error signal on the error 

30 signal line E from the demultiplexer 21 0 is coupled to a control input terminal of the 
selector 280. 



SDOCID: <WO__2004066706A2J_> 



WO 2004/066706 



PCT/US2004/002062 



34 

In operation, the demodulator 207 channel decodes (demodulates) the 
received composite signal as appropriate, as described in more detail above. The 
demultiplexer 210 extracts the packet streams (0) to (n) corresponding to those 
generated by the plurality 170 of video encoders illustrated in Fig. 9b. The plurality 
5 220 of delay devices realigns all these packet streams (0) to (n) in time at the 

respective input terminals of the plurality 270 of video decoders. Each packet stream 
is coupled to the video decoder appropriate for decoding the video signal carried by 
that packet stream. For example, the packet stream carrying the QCIF quality video 
signal is coupled to the QCIF decoder 270(0); the packet stream carrying the CIF 

1 0 quality video signal is coupled to the CIF decoder 270(1 ) and so forth. Each video 
decoder in the plurality 270 of video decoders depacketizes and source decodes the 
signal supplied to it to generate a video signal. The error signal E from the 
demultiplexer 210 indicates which of the packet streams (0) to (n) is unavailable due 
to errors (e.g. missing packets). The selector 280 is conditioned to couple the 

15 highest quality video signal produced from available packet streams to the output 
terminal 215. 

One skilled in the art will understand that image scaling may be required for 
some of the lower quality video image signals in the transmitter systems illustrated in 
Fig. 9. The encoders, either the hierarchical encoder 160 of Fig. 9a or the plurality 
20 170 of encoders of Fig. 9b, include any such image scaling circuitry which is 
necessary it is not shown to simply the figure. 

For the communications system illustrated in Fig. 9 and Fig. 10, any of the 
packet streams produced by the hierarchical encoder 160 (of Fig. 9a) or any of the 
plurality 170 of video encoders (of Fig. 9) may be source encoded according to the 

25 robust source encoding scheme (JVT) and channel encoded (modulated) by the 

robust modulation scheme (4-VSB and/or 2-VSB), as described in more detail above. 
The corresponding demodulation and decoding of that packet stream takes place in 
the receiver of Fig. 1 0. Also, the lowest quality video signal is advanced the most, 
and consequently has the highest fade resistance. Further, the lowest video quality 

30 signal may be encoded with the least number of bits and thus takes a small amount 
of time to transmit. As the video quality of the video signal carried by packet streams 
increases, the time by which that packet stream is advanced decreases, 
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consequently the fade resistance decreases. Thus, when the channel characteristic 
has no fades, then the packet stream(s) carrying the highest video quality signal 
remain(s) available. Mild fades leave packet stream(s) carrying lower video quality 
signals available, and severe fades leave only the packet stream carrying the lowest 
5 quality video signal available. This gradual reduction in video quality as channel 
characteristics degrade is a desirable characteristic for a viewer. 

As described above, and illustrated in Fig. 1 and Fig. 9b, the same content 
representative signal may be staggercasted as a packet stream carrying a high 
quality video signal and as one or more packet streams carrying reduced video 

10 quality video signals. In such a communications system, it is, therefore, possible for 
some receivers, for example, a television receiver in a cellular phone or personal 
digital assistant (PDA), to extract and decode only a reduced quality content 
representative signal. In such a receiver, the display device is lower resolution and 
may only be able to display a reduced quality video signal. Further, the use of battery 

15 power makes it advantageous to minimize the amount of data processed. Both of 
these considerations suggest that such receivers decode only the packet stream 
carrying a video signal of appropriate video quality and display that image. 

Fig. 10c illustrates a receiver. In Fig. 10c, the input terminal 205 is coupled to 
the input terminal of the demodulator 207. An output terminal of the demodulator 207 
20 is coupled to the input terminal of the demultiplexer 21 0 f An output terminal of the 
demultiplexer 21 0 is coupled to an input terminal of a decoder 270. An output 
terminal of the decoder is coupled to the output terminal 215. 

In operation, the demodulator 207 demodulates the received composite signal 
in the appropriate manner, as described in more detail above. The demultiplexer 210 

25 selects only a single packet stream having a video signal of the desired quality. For 
example, this may be a QCIF format video signal, such as produced by the QCIF 
encoder 170(0) of Fig. 9b and carried on packet stream (0). The packet stream (0) is 
extracted by the demultiplexer 210 and is decoded by the decoder 270 to produce the 
QCIF format video signal. Such a receiver need only receive the table illustrated in 

30 Fig. 8 to determine the PID of the desired lower quality video signal packet stream 
(0). From the resolution data 812 transmitted in the table, the mobile receiver is able 
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to select the packet stream carrying the reduced quality video signal desired for 
processing. 

The communications system illustrated in Fig. 9 and Fig. 10 may be further 
extended. In the systems described above, video information carried in additional 
5 packet streams, may be used to provide graceful degradation under worsening 
channel conditions. However, such systems may also transmit additional video 
information which can enhance the quality of video signals under good channel 
conditions. By including a packet stream carrying augmented video information, in 
addition to the packet stream carrying the normal video signal, an augmented video 
10 image may be transmitted. 

Fig. 11 is a block diagram of a portion of a transmitter for transmitting a dual 
interlaced video signal and Fig. 12 is a block diagram of a portion of a receiver for 
receiving a dual interlaced video signal. Fig. 13 is a display diagram useful in 
understanding the operation of the dual interlace transmitter illustrated in Fig. 1 1 and 
15 the dual interlace receiver illustrated in Fig. 12. In Fig. 1 1 , those elements which are 
the same as those illustrated in Fig. 1 are designated by the same reference number 
and are not described in detail below. In Fig. 12, those elements which are the same 
as those illustrated in Fig. 6 are designated by the same reference number and are 
not described in detail below. 

20 Referring to Fig. 1 3, a content source produces a progressive scan video 

display, illustrated schematically at the top of Fig. 13 as a sequence of video lines 
1310 within a display border 1320. A normal HD video image picture includes 1080 
lines. Such an HD video image is transmitted at a rate of 30 frames per second in 
interlaced format. That is, an interlacer generates two fields: a first field including 

25 only odd numbered lines and a second field including only even numbered lines. 
These fields are transmitted successively at a rate of 60 fields per second. 

In Fig. 1 1 , the input terminal 105 is coupled to a dual output interlacer 1 02. A 
first output terminal of the dual output interlacer 102 is coupled to the input terminal of 
the robust mode encoder 1 10. A second output terminal of the dual output interlacer 
30 1 02 is coupled to the input terminal of the normal mode encoder 120. 
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Referring again to Fig. 13, the frame display image 1330(A) corresponds to 
the video signal A produced at the first output terminal of the dual output interlacer 
102 and the frame display image 1330(B) corresponds to the video signal B produced 
at the second output terminal of the dual output interlacer 102. In the frame display 
5 images 1330(A) and 1330(B), solid lines are transmitted In one field, and dotted lines 
are transmitted in the following field. In the frame display image in 1330(A) solid lines 
are odd lines and dotted lines are even lines; and in the frame display image 1330(B), 
solid lines are even lines and dotted lines are odd lines. This is illustrated in more 
detail in the field display images 1340(A), 1340(B), 1350(A) and 1350(B) beneath the 
10 frame display images 1330 (A) and 1330(B). In field 1, video signal A transmits the 
odd lines as illustrated in field display image 1340(A), and video signal B transmits 
the even lines, as illustrated in field display image 1340(B). In field 2, the video signal 
A transmits the even lines as illustrated in field display image 1350(B) and the video 
signal B transmits the odd lines as illustrated in field display image 1350(B). 

15 As described in more detail above, the video signal A is source encoded using 

JVT source encoding, then system encoded (packetized) by the robust mode 
encoder 110. The video signal B is source encoded using MPEG 2 source encoding, 
then system encoded (packetized) by the normal mode encoder. The modulator 
channel encodes (modulates) the robust mode packet stream using 4-VSB and/or 2- 

20 VSB modulation, and modulates the normal mode packet stream using 8-VSB 
modulation. 

In Fig. 12, a fiijst output terminal of the demultiplexer 210 is coupled to the 
input terminal of the normal mode decoder 240' and a second output terminal of the 
demultiplexer 210 is coupled to the input terminal of the delay device 220. The 

25 output terminal of the normal mode decoder 240' is coupled to a first signal input 
terminal of a dual input deinterlacer 202 and the output terminal of the robust mode 
decoder 240" is coupled to a second signal input terminal of the dual input 
deinterlacer 202. The error signal from the demultiplexer 210 is coupled to a control 
input terminal of the dual input deinterlacer 202. An output terminal of the dual input 

30 deinterlacer 202 is coupled to the output terminal 215. 
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As described in more detail above, the demodulator 207 channel decodes 
(demodulates) the robust mode packet stream using 4-VSB and/or 2-VSB 
demodulation and demodulates the normal mode packet stream using 8-VSB 
demodulation. The normal mode decoder 240' system decodes (depacketizes) and 
5 source decodes the normal mode packet stream using JVT decoding to reproduce 
the video signal B. The robust mode decoder 240" depacketizes and source 
decodes the robust mode packet stream using MPEG 2 decoding to reproduce the 
video signal A. 

The dual input deinterlacer 202 operates to combine the interlaced scan lines 
1 0 of the video signal A from the robust mode decoder 240" with the interlaced scan 
lines of the video signal B from the normal mode decoder 240' to produce a 
progressive scan field. For field 1, the odd scan lines from video signal A, illustrated 
in field display image 1 340(A), are combined with the even scan lines from video 
signal B, illustrated in field display image 1340(B). The resulting progressive scan 
1 5 field is illustrated in the field display image 1 345. For field 2, the even scan lines from 
video signal A, illustrated in field display image 1350(A), are combined with the odd 
scan lines from video signal B, illustrated in field display image 1350(B). The 
resulting progressive scan field is illustrated in the field display image 1355. Thus, a 
progressive scan field may be produced at the output terminal of the dual input 
20 deinterlacer 202 each field period. For an HD signal, this means that a full 1 080 line 
image is produced 60 times per second. 

The dual interlaced technique described above and illustrated in Fig. 1 1 , Fig. 
12 and Fig. 13 may also be combined with the techniques described above to 
provide a wider range of graceful degradation in the event channel conditions 

25 degrade. If channel conditions render one of the packet streams carrying video 
signals A or B unavailable, then the error signal E indicates this to the dual input 
deinterlacer 202. The dual input deinterlacer 202 begins producing the standard HD 
interlaced video signal from the available video signal. The display device (not 
shown), is reconfigured to display the image represented by the standard interlaced 

30 video signal until the other video signal becomes available again. If neither of the HD 
video signals are available, then the highest quality available video signal may be 
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displayed, as described in detail above with reference to the transmitter in Fig. 9 and 
the receiver in Fig. 10, 

The same technique may also be used to convert any interlaced format video 
signal, for example an SD video signal, to a progressive scan video signal at twice 
5 the frame rate. It is not necessary for the two video signals A and B to be 

staggercasted, as illustrated in Fig. 1 1 and Fig. 12. It is only necessary that they be 
simulcasted. However, staggercasting additionally provides graceful degradation in 
the presence of fade events, as described above. 

The communications system described above may be further extended to 
10 cooperate with a recording device, such as a digital personal video recorder (PVR). 
Such PVR devices are becoming included in digital television receivers due to the 
decreasing costs of such a device. In Fig. 9b, a PVR device 295 includes a video 
terminal (Vid) bidirectionally coupled to the selector 280, and a control terminal (Ctl) 
also bidirectionally coupled to the selector 280, as illustrated in phantom. The 
15 selector 280 is also coupled to a source of user control, also as illustrated in 
phantom. 

The selector 280 is configured to couple any desired video signal from the 
plurality 270 of video detectors to the PVR 295 independently of the input video 
signal coupled to the output terminal 215. The selector 280 may also be configured 
20 to couple an input video signal from the PVR 295 to the output terminal 21 5 for 

playback. The selector 280 may also supply control data to the PVR 295, and the 
PVR 295 supply status data to the selector 280 over the bidirectional control terminal. 

The PVR 295 may be controlled in several modes of operation. In one mode 
of operation, the best available video signal is coupled to the PVR 295 for recording. 

25 In this operational mode, the selector 280 couples the same input video signal to the 
PVR 295 as is coupled to the output terminal 215. This will result in the best quality 
video signal being recorded, but will take the most storage space, in the PVR 295. 
This will take advantage of the normal mode and robust mode packet streams 
carrying the video signal and the graceful degradation that provides. Alternatively, a 

30 lower resolution video signal may be coupled to the PVR 295 than is coupled to the 
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output terminal 215. For example, while the selector 280 may couple the best 
available video signal to the output terminal 215, the selector 280 may couple a video 
decoder 270 producing a lesser quality video signal to the PVR 295. This lesser 
quality video signal may be a selected one of the available video signals, such as the 
5 SD quality video signal from the SD decoder 270(n-1 ), with graceful degradation 

supplied by the lesser quality video decoders. Such a signal will require less storage 
space in the PVR 295 than the best available video signal. This will help to conserve 
storage space in the PVR 295, and allow for longer recording times. In the event that 
the selected lower quality video signal becomes unavailable, a higher quality signal 

10 may be recorded until the lower quality signal becomes available again. The 

selection of which lesser quality video to record (i.e. SD, or CIF or QCIF) may be 
directly selected by a viewer via the user input terminal. Alternatively, the selector 
280 may automatically control this selection according to some criterion. For 
example, a status signal from the PVR 295 can indicate the amount of storage 

1 5 remaining in the PVR 295. As the amount of storage remaining drops, the selector 
280 may automatically couple a video decoder 270 having reduced video quality to 
the PVR 295. Other criteria may be derived and used to control which video signal is 
coupled to the PVR 295 by the selector 280. 

Similarly, a user may desire to control the selection and display of the 
20 television programs being broadcast by a transmitter. In existing broadcasting 
systems, one of the transmitted packet streams carries a user program guide, 
containing information about all programs currently being broadcast and those due to 
be broadcast in the near future. From the program guide data, an image of a table 
listing all such programs, their channels and times may be generated by an on- 
25 screen display generator (OSD) 282 as illustrated in Fig. 10b. A user may control 
the display of the program guide information as an aid in finding a desired program 
and selecting that program to view using a user interface. The user interface displays 
images to present information to a viewer, requests input from a viewer and accepts 
viewer input from controls which may be incorporated in the receiver or in a remote 
30 control. Existing systems allow a viewer to request additional information about a 
program listing, such as a more detailed description of the program, a rating (G, PG, 
R, etc.), time duration, time remaining and so forth. 
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Additional information related to the staggercasting system described above 
may be added to the displayed program table, or the additional-information display. 
This information may be derived from the PSIP-VCT/PMT tables illustrated in Fig. 8. 
For example, additional indicators may be added to the displayed program table 
5 and/or additional-information display indicating that: this program is being 

staggercasted; what the video quality is of the video signals being staggercasted; 
what the audio quality of the audio signals being staggercasted; and so forth. By 
displaying this information for a viewer, the viewer is able to base selection of a 
program on it; More specifically, a viewer may select a program that is being 
10 staggercasted; or may select a program having video signal of a desired video 
quality, e.g. to match the display device to which the signal is being supplied. 

Current receivers also allow a viewer to set certain parameters. For example, 
a user may wish to automatically view all transmitted channels, or only channels to 
which the viewer is subscribed, or the subscribed channels plus pay-per-view 

15 channels, and so forth without having to manually change the on-screen-display each 
time it is displayed. A user interface presents a user with a screen image, via the 
OSD 282, on which this selection may be made using the user controls. An 
additional screen image may be produced, or an existing screen image modified, on 
which a viewer sets choices about selection and display of video signals which have 

20 been staggercasted, as described above. For example, a viewer may select to have 
the program table display only staggercasted programs, or to display staggercasted 
programs carrying video signals at or above a minimum video quality. 
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CLAIMS 



1 A method for storing staggercasted content, comprising the steps of: 
encoding a set of signals representing content, the set capable of being 

decoded to produce a corresponding set of decoded signals, each decoded signal 
5 having a quality different from the qualities of the decoded signals corresponding to 

the other encoded signals; 

generating a composite signal comprising the set of encoded signals 
staggered in time; 

extracting the set of encoded signals from the composite signal; 
1 0 detecting errors in the set of extracted encoded signals to produce a subset of 

available extracted encoded signals which are not erroneous; 

decoding a content representative signal at a selectable desired quality; and 
storing the decoded content representative signal in a storage device. 

2. The method of claim 1 wherein if a content representative signal at the 
15 desired quality is not available, decoding a content representative signal at a 

selectable desired quality from the subset of available extracted encoded signals and 
storing the decoded content representative signal. 

3. The method of claim 2 wherein the decoding step comprises the step of 
selecting the content representative signal at the desired quality automatically. 

20 4 - The method of claim 3 wherein the step of selecting the desired quality 

automatically comprises the step of selecting the desired quality in response to preset 
selection parameters. 

5. The method of claim 4 wherein the parameters are preset in response 
to user input. 
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6. The method of claim 1 wherein the step of selecting the desired quality 
automatically comprises the step of selecting the desired quality in response to the 
status of the storage device. 

7. The method of claim 6 wherein if the status of the storage device 

5 indicates that the storage device is nearly full, the desired quality is automatically a 
lower quality. 

8. The method of claim 1 wherein the decoding step comprises the step of 
selecting the desired quality in response to user input. 

9. The method of claim 8 wherein the step of selecting the desired quality 
10 in response to user input comprises the steps of: 

displaying an image representing information related to the encoded set of 
signals; and 

receiving user input after displaying the information display. 

10. The method of claim 9 wherein: 

1 5 the content representative signal represents a television program; 

the step of generating a composite signal comprises the step of further 
including a signal carrying information related to the television program comprising 
the respective qualities of the set of encoded content representative signal; and 
the step of displaying an image of the encoded signal representative 
20 information comprises listing the information related to the television program and the 
qualities of the set of encoded signals. 

11. The method of claim 10 wherein the television program information 
carrying signal carries data representing either or both of a program map table (PMT) 
and a program and information systems protocol-virtual channel table (PSIP-VCT). 

25 12. The method of claim 1 wherein the composite signal generating step 

comprises generating the set of encoded signals such that a lowest quality decoded 
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signal is undelayed, and the other encoded signals are delayed with respect to the 
encoded signal corresponding to the lowest quality decoded signal such that the 
higher the quality of the corresponding decoded signal, the longer the delay period. 

13. The method of claim 1 wherein the encoding step comprises the step of 
5 encoding at least one of the set of encoded signals using a technique which is robust 

relative to the encoding of the other encoded signals. 

14. The method of claim 13 wherein the at least one robust encoded signal 
comprises the encoded signal corresponding to the lowest quality decoded signal. 

15. The method of claim 13 wherein the set of encoded signals are channel 
10 encoded, and the robust encoded signals are channel encoded using a channel 

coding technique robust relative to the channel coding technique used for the non- 
robust encoded signals. 

1 6. The method of claim 1 5 wherein the channel coding for the robust 
encoded signals is one of 4-VSB or 2-VSB modulation and the channel coding for the 

15 non-robust encoded signals is 8-VSB modulation. 

17. A staggercasting receiver, for receiving a composite signal comprising a 
set of encoded signals, staggered in time, representing content, the set capable of 
being decoded to produce a corresponding set of decoded signals, each decoded 
signal having a quality different from the qualities of the decoded signals 

20 corresponding to the other encoded signals, the receiver comprising: 

a demultiplexer, responsive to the composite signal, for extracting the set of 
encoded signals, detecting errors in respective encoded signals, and producing a 
subset of available extracted signals which are not erroneous; 

a decoder, coupled to the demultiplexer and responsive to the error 
25 representative signal, for reproducing a content representative signal at a selectable 
desired quality; and 
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a storage device, coupled to the decoder, for storing the reproduced content 
representative signal. 

18. The receiver of claim 17 wherein the decoder comprises circuitry for 
reproducing a content representative signal at the highest quality from the subset of 

5 available extracted encoded signals. 

19. The receiver of claim 17 wherein the decoder comprises circuitry for 
reproducing a content representative signal at a selectable desired quality from the 
subset of available extracted encoded signals is a content representative signal at the 
desired quality is not available. 

10 20. The receiver of claim 1 9 wherein the decoder further comprises circuitry 

for automatically reproducing the content representative signal at the desired quality. 

21 . The receiver of claim 20 wherein the decoder further comprises circuitry 
for storing preset selection parameters, and for automatically reproducing the content 
representative signal at the desired quality in response to the selection parameters. 

15 22. The receiver of claim 1 7 wherein: 

the storage device generates a signal representing the status of the storage 
device; and 

the decoder comprises circuitry for automatically reproducing the content 
representative signal at the desired quality in response to the status representative 
20 signal. 

23. The receiver of claim 22 wherein the decoder automatically reproduces 
the content representative signal at a lower quality in response to the status 
representative signal indicating that the storage device is nearly full. 

24. The receiver of claim 21 wherein further comprises circuitry for storing 
25 selection parameters in response to user input. 
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25. The receiver of claim 17 wherein the decoder comprises circuitry for 
reproducing the content representative signal at the desired quality in response to 
user input 

26. The receiver of claim 25 further comprising an on-screen-display device 
5 for displaying an image representing information related to the encoded set of 

signals. 

27. The receiver of claim 26 wherein 

the content representative signal is a television program and the composite 
signal further comprises a signal carrying information related to the television 
10 program comprising the respective qualities of the set of encoded content 
representative signal; and 

the on-screen-display device displays a listing of the information related to the 
television program and the qualities of the set of encoded signals. 

28. The receiver of claim 27 wherein the television program information 

1 5 carrying signal carries data representing either or both of a program map table (PMT) 
and a program and information systems protocol-virtual channel table (PSIP-VCT). 

29. The receiver of claim 17 wherein at least one of the set of encoded 
signals is encoded using a technique which is robust relative to the encoding of the 
other signals, and the decoder comprises a decoder, responsive to the at least one 

20 encoded signal, for decoding the at least one encoded signal. 

30. The receiver of claim 29 wherein the at least one robust encoded signal 
comprises the encoded signal corresponding to the lowest quality decoded signal. 

31. The receiver of claim 30 wherein: 

the set of encoded signals are channel coded, and the robust encoded signals 
25 are channel encoded using one of 4-VSB or 2-VSB modulation and the other 
encoded signals are channel encoded using 8-VSB modulation; and 
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the decoder comprises a demodulator for channel decoding the robust 
encoded signals using one of 4-VSB or 2-VSB demodulation and channel decoding 
the other encoded signals using 8-VSB demodulation. 

32. The receiver of claim 17 wherein 

the composite signal comprises the set of encoded signals such that a lowest 
quality decoded signal is undelayed, and the other encoded signals are delayed with 
respect to the encoded signal corresponding to the lowest quality decoded signal 
such that the higher the quality of the corresponding decoded signal, the longer the 
delay period; and 

the receiver further comprising a plurality of delay circuits, coupled between 
the demultiplexer and the decoder and respectively responsive to the set of extracted 
encoded signals, for realigning the extracted encoded signals in time. 
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